Formant Estimation using an Autoassociative Neural Network
نویسندگان
چکیده
Building on algorithms developed in earlier work (Hawkins et al., 1994a, 1994b; Hawkins, 1997; Hawkins et al., 2002), this study develops a new technique for improving the accuracy of formant estimates produced by an analysis-by-synthesis formant tracker (DPTRAK, Clermont, 1992). DPTRAK is evaluated by comparing its formant estimates against those obtained manually by the first author when he inspected the spectrogram of each vowel produced by each speaker. Applied to 13 male speakers uttering the 11 monophthongs and eight diphthongs of Australian English, DPTRAK produced results that varied in accuracy across speakers. The percentage of speech frames tracked accurately varied from 99% for the best speaker through to 58% for the worst speaker. We develop the SpeechSifter algorithm to sift through the speech frames tracked by the DPTRAK formant tracker (or any other formant tracker) and select only those frames that are likely to be accurately tracked. This unsupervised algorithm first selects the ideal speaker on which to train a Replicator Neural Net (Hawkins et al., 2002). The trained Replicator Neural Net is then used to screen those speech frames on which the formant tracker is highly likely to have made accurate formant estimates and to discard the rest. We demonstrate the value of this approach. First, we demonstrate that we can accurately predict which speaker will provide the ideal training speaker for the RNN. Next, we apply the trained RNN to a speaker and show that that it is possible to achieve a 90% accuracy rate whilst retaining 75% of the speaker’s original speech frames. This is an improvement on the DPTRAK algorithm which achieves an accuracy rate of only 81% for this speaker.
منابع مشابه
Formant Estimation and Tracking Using Deep Learning
Formant frequency estimation and tracking are among the most fundamental problems in speech processing. In the former task the input is a stationary speech segment such as the middle part of a vowel and the goal is to estimate the formant frequencies, whereas in the latter task the input is a series of speech frames and the goal is to track the trajectory of the formant frequencies throughout t...
متن کاملEstimation of Industrial Production Costs, Using Regression Analysis, Neural Networks or Hybrid Neural - Regression Method?
Estimation (Forecasting) of industrial production costs is one of the most important factor affecting decisions in the highly competitive markets. Thus, accuracy of the estimation is highly desirable. Hibrid Regression Neural Network is an approach proposed in this paper to obtain better fitness in comparison with Regression Analysis and the Neural Network methods. Comparing the estimated resul...
متن کاملEstimation of Cadmium and Uranium in a stream sediment from Eshtehard region in Iran using an Artificial Neural Network
Considering the importance of Cd and U as pollutants of the environment, this study aims to predict the concentrations of these elements in a stream sediment from the Eshtehard region in Iran by means of a developed artificial neural network (ANN) model. The forward selection (FS) method is used to select the input variables and develop hybrid models by ANN. From 45 input candidates, 13 and 14 ...
متن کاملError Modeling in Distribution Network State Estimation Using RBF-Based Artificial Neural Network
State estimation is essential to access observable network models for online monitoring and analyzing of power systems. Due to the integration of distributed energy resources and new technologies, state estimation in distribution systems would be necessary. However, accurate input data are essential for an accurate estimation along with knowledge on the possible correlation between the real and...
متن کاملEstimation of Reference Evapotranspiration Using Artificial Neural Network Models and the Hybrid Wavelet Neural Network
Estimation of evapotranspiration is essential for planning, designing and managing irrigation and drainage schemes, as well as water resources management. In this research, artificial neural networks, neural network wavelet model, multivariate regression and Hargreaves' empirical method were used to estimate reference evapotranspiration in order to determine the best model in terms of efficienc...
متن کامل